Lin-Kernighan Heuristic Adaptations for the Generalized 
Traveling Salesman Problem 



D. Karapetyan'''*, G. Gutin'' 

"Royal HoUoway London University, Egham, Surrey, TW20 OEX, United Kingdom 



Abstract 

The Lin-Kernighan heuristic is known to be one of the most successful heuristics for the 
TraveUng Salesman Problem (TSP). It has also proven its efficiency in application to 
some other problems. 

In this paper we discuss possible adaptations of TSP heuristics for the Generalized 
Traveling Salesman Problem (GTSP) and focus on the case of the Lin-Kernighan al- 
gorithm. At first, we provide an easy-to-understand description of the original Lin- 
Kernighan heuristic. Then we propose several adaptations, both trivial and complicated. 
Finally, we conduct a fair competition between all the variations of the Lin-Kernighan 
adaptation and some other GTSP heuristics. 

It appears that our adaptation of the Lin-Kernighan algorithm for the GTSP reproduces 
the success of the original heuristic. Different variations of our adaptation outperform all 
other heuristics in a wide range of trade-offs between solution quality and running time, 
making Lin-Kernighan the state-of-the-art GTSP local search. 

Keywords: Heuristics, Lin-Kernighan, Generalized Traveling Salesman Problem, 
Combinatorial Optimization. 



1. Introduction 

One of the most successful heuristic algorithms for the f amous Trav eling Salesman 
Problem (TSP) known so far is the Lin-Kernighan heuristic ( Lin and Kernig han. 19731). 



It was propo sed almost forty years ago bu t even nowadays it is the state-of-the-art TSP 



local search jJohnson and McGeochl 120021) 



In this paper we attempt to reproduce the success of the original TSP Lin-Kernighan 
heuristic for the Generalized Traveling Salesman Problem (GTSP), which is an important 
extension of TSP. In the TSP, we are given a set V of n vertices and weights w{x y) 
of moving from a vertex a; G ^ to a vertex y ^ V. A feasible solution, or a tour, is a 
cycle visiting every vertex in V exactly once. In the GTSP, we are given a set V of n 
vertices, weights w{x ^ y) of moving from x £ V to y G V and a partition of V into m 
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nonempty clusters Ci , C2 , . . . , C,„ such that Ci OCj =0 for each i ^ j and |J,- Ci = V. 
A feasible solution, or a tour, is a cycle visiting exactly one vertex in every cluster. The 
objective of both TSP and GTSP is to find the shortest tour. 

If the weight matrix is symmetric, i.e., w{x — > y) = w{y — ^ x) for any x,y G V, the 
problem is called symmetric. Otherwise it is an asymmetric GTSP. In what follows, the 
number of vertices in cluster C; is denoted as \Ci\, the size of the largest cluster is s, and 
Cluster{x) is the cluster containing a vertex x. The weight function w can be used for 
edges, paths w{xi X2 ^ ■ ■ ■ ^ Xk) = w{xi — >■ X2) + 'w{x2 ^3) + . . . + w{xk-i — >■ 
Xk), and cycles. 

Since Lin-Kernighan is designed for the symmetric problem, we do not consider the 
asymmetric GTSP in this research. However, some of the algorithms proposed in this 
paper are naturally suited for both symmetric and asymmetric cases. 

Observe that the TSP is a special case of the GTSP when \Ci\ — 1 for each i and, 
hence, the GTSP is NP-hard. The GTSP has a host of applications in warehouse order 
picking with multiple stock locations, sequencing computer files, postal routing, airport 



selection and routing for courier planes and some others, see, e.g., dFischetti et al.Lll995 



1997 ; Laporte et al. , 19961 : Noon and Bean , 1991 ) and references therein. 



A lot of attention was paid in the literature to solving the GTSP. Sev eral researchers 
dBen-Arieh et all 120031: iLaporte and Semetl Il999t l iNoon and Beanlll993h proposed trans- 
formations of the GTSP into the TSP. At first glance, the idea to transform a little-studied 
problem into a well-known one seems to be natural; however, this approach has a very lim- 
ited application. On the one hand, it requires exact solutions of the obtained TSP instances 
because even a near-optimal solution of such TSP may correspond to an infeasible GTSP 
solution. On the other hand, the produced TSP instances have quite an unusual structure 
which is difficult for the existing solvers. A mor e efficient way to solv e the GTSP ex- 
actly is a branch-and-bound algorithm designed by lFischetti et al.l (Il997h . This algorithm 
was able to solve instances with up to 89 clusters. Two approximation algorithms were 
proposed in the literature, but both of them are unsuitable for the general case of the prob- 
lem, andjhe_juaranti£d^ quality is unreasonably low for real-world applications, 
see ( Bontoux et al. , 2010l) and references therein. 

In order to obtain good (i.e., not necessarily exact) solutions for larger GTSP in- 
stances, one should use th e heuristic approach. Several construction heuristics and local 
searches were discussed in ( Bontoux et all 2010l: Gutin and Karapetvan , 2010l : Hu and Raidl , 
2008 : Renaud and Boctor , ll998l; ISnvder and Das kin', '20061) and some others. A number 



of metaheuristics were proposed by Bontoux et al.i (i201Q)TGutin and Karapetvanl(l2010h: 



Gutin et al.l(E008l).lHuang et al.l(l20oi^ 



Pintea et al , 

Snyder and DaskinI (l2006l) : lTasgetiren et all (l2007l) : lYang et all (l2008l) 



(l2007h:ISilberholz and GoldenI (l2007h : 



In this paper we thoroughly discuss possible adaptations of a TSP heuristic for the 
GTSP and focus on the Lin-Kernighan algorithm. The idea of the Lin-Kernighan algo- 



rithm was already successfully appl ied to the Multidimensional Assignment Problem jBalas and Saltzman , 



199ll;lKarapetvan and Gutin , 2010l) . A straightforward adaptation for the GTSP was pro- 
posed by iHu and Raid ll (l2008hT their algorithm constructs a set of TSP i nstanc es and 
solves all of them with the TSP Lin-Kernighan heuristic. Bontoux et al. ( 2010l) apply 
the TSP Lin-Kernighan heuristic to the TSP tours induced by the GTSP tours. It will be 
shown in Section[3]that both of these approaches are relatively weak. 

The Lin-Kernighan heuristic is a sophisticated algorithm adjusted specifically for the 
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TSP. The explanation provided by iLin and KernighanI (1 1973b is full of details which 
complicate understanding of the main idea of the method. We start our paper from a clear 
explanation of a simplified TSP Lin-Kernighan heuristic (Section|2]i and then propose sev- 
eral adaptations of the heuristic for the GTSP (Section[3]l. In SectionlH we provide results 
of a thorough experimental evaluation of all the proposed Lin-Kernighan adaptations and 
discuss the success of our approach in comparison to other GTSP heuristics. In Section|5] 
we discuss the outcomes of the conducted research and select the state-of-the-art GTSP 
local searches. 



2. The TSP Lin-Kernighan Heuristic 

In this section we describe the TSP Lin-Ke rnighan heuristic (LKisp). It is a simpi 



fled version of the original algorithm. Note that jLin and KernighanI 1 1973h was published 
almost 40 years ago, when modest computer resources, obviously, influenced the algo- 
rithm design, hiding the main idea behind the technical details. Also note that, back then, 
the 'goto' operator was widely used; this affects the original algorithm description. In 
contrast, our interpretation of the algorithm is easy to understand and implement. 

LKtsp is a generaUzation of the fc-opt local search. The fc-opt neighborhood Nk.opi{T) 
includes all the TSP tours which can be obtained by removing k edges from the origi- 
nal tour T and adding k different edges such that the resulting tour is feasible. Observe 
that exploring the whole Nk-opt{T) takes 0{n^) operati ons and, thus, with a few ex 



ceptions, only 2-opt and rarely 3-opt are used in practice jJohnson and McGeochl I2002t 



Rego and Gloveii 120061) 



Similarly to fc-opt, LKtsp tries to remove and insert edges in the tour but it explores 
only some parts of the fc-opt neighborhood that deem to be the most promising. Consider 
removing an edge from a tour; this produces a path. Rearrange this path to minimize its 
weight. To close up the tour we only need to add one edge. Since we did not consider 
this edge during the path optimization, it is likely that its weight is neither minimized nor 
maximized. Hence, the weight of the whole tour is probably reduced together with the 
weight of the path. Here is a general scheme of LKtsp: 

1 . Let T be the original tour. 

2. For every edge e — > 6 G T do the following: 

(a) Let P = h ^ . . . ^ ehe the path obtained from T by removing the edge 

e ^ b. 

(b) Rearrange P to minimize its weight. Every time an improvement is found 
during this optimization, try to close up the path P. If it leads to a tour shorter 
than T, save this tour as T and start the whole procedure again. 

(c) If no tour improvement was found, continue to the next edge (Step|2]l. 

In order to reduce the weight of the path, a local search is used as follows. On every 
move, it tries to break up the path into two parts, invert one of these parts, and then rejoin 
them (see Figure [U. In particular, the algorithm tries every edge x y and selects the 
one which maximizes the gain g = w{x y)— w{e — > x). If the maximum g is positive, 
the corresponding move is an improvement and the local search is applied again to the 
improved path. 
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(a) The original path. 




(b) The path after a local search move. 



Figure 1; An example of a local search move for a path improvement. The weight of the 
path is reduced by w{x y) — w{x — > e). 

Observe that this algorithm tries only the best improvement and skips the other ones. 
A natural enhancement of the heuristic would be to use a backtracking mechanism to 
try all the improvements. However, this would slow down the algorithm too much. A 
compromise is to use the backtracking only for the first a moves. This approach is imple- 
mented in a recursive function ImprovePath{P, depth, R), see Algorithm[T] 



Algorithm 1 ImprovePath{P, depth, R) recursive algorithm (LKjsp version). The func- 
tion either terminates after an improved tour is found or finishes normally with no profit. 

Require: The path P = fo —>...—> e, recursion depth depth and a set of restricted 
vertices R. 
if depth < a then 

for every edge x ^ y E P such that x ^ Rdo 

Calculate g = w{x — > y) — w{e — > x) (see Figure [Tbil. 
if g > Othen 

if the tour 6^...— >a-— !>e— >j/^>feisan improvement over the 
original one then 

Accept the produced tour and terminate, 
else 

ImprovePath{b .t e y, depth + 1, RU {x}). 

else 

Find the edge x ^ y which maximizes g = wlx — > y) — w{e — !• x). 
if 5 > then 

if the tour b^...^x^e^...^y^bisan improvement over the original 
one then 

Accept the produced tour and terminate, 
else 

return ImprovePath{h . . . ^ x ^ e ^ . . . ^ y, depth + 1, i? U {x}). 



ImprovePath{P,l, 0) takes 0(n" • depth^^^^) operations, where depth^^^^^ is the 
maximum depth of recursion achieved during the run. Hence, one should use only small 
values of backtracking depth a. 
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The algorithm presented above is a simpHfied Lin-Kernighan heuristic. Here is a Ust 
of major differences between the described algorithm and the original one. 



1. 



The original heuristic does not accept the first found tour improvement. It records it 
and continues optimizing the path in the hope of finding a better tour improvement. 
Note that it was reported by Helsgaun ( 2000l) that this complicates the algorithm 
but does not really improve its quahty. 

The original heuristic does not try all the n options when optimizing a path. It con- 
siders only the five shortest edges a; — > e in the non-decreasing order. This hugely 
reduces the running time and helps to find the best rather than the first improvement 
on the backtracking stage. However, this speed-up approach is known to be a weak 
point of the original implementation (lHelsgaunil2000tlJohnson and McGeochi 120021) 



Indeed, even if the edge a; y is long, the algorithm does not try to break it if the 
edge X — e is not in the list of five shortest edges to e. 

Note that looking for the closest vertices or clusters may be meaningless in the 
application to the GTSP. In our implementation, every edge x ^ y is considered. 
The original heuristic does not allow delet ing the previously added edges or adding 
the pr eviously deleted edges. It was noted dHelsgaun , l2000l: l iJohnson and McGeochi 
20021) that either of these restrictions is enough to prevent an infinite loop. In our 



implementation a previously deleted edge is allowed to be added again but ev- 
ery edge can be deleted only once. Our implementation also prevents some other 
moves; however, the experimental evaluation shows that this does not affect the 
performance of the heuristic. 

4. The original heuristic also considers some more sophisticated moves to produce a 
path from the tour. 

5. The original heuristic is, in fact, embedded into a metaheuristic which runs the 
optimization several times. There are several tricks related to the metaheuristic 
which are inapplicable to a single run. 

The worst case time complexity of the Lin-Kernighan heuristic seems to be unknown 
from the literature ( Helsgaum 20091) but we assume that it is exponential. Indeed, observe 



th at the number of itera tions of the fc-opt local search may be non-polynomi al for any 



k dChandra et al.L 1 19941) and that LKtsp is a modification of fc-opt. However, iHelsgaun 



(l2009h notes that such undesirable instances are very rare and normally LKtsp proceeds in 



a polynomial time. 



3. Adaptations of the Lin-Kernighan Heuristic for the GTSP 

It may seem that the GTSP is only a slight vari ation of the TSP. In partic ular, one 



may propose splitting the GTSP into two problems ( iRenaud and BoctoAll998h : solving 
the TSP induced by the given tour to find the cluster order, and finding the shortest cycle 
visiting the clusters according to the found order. We will show now that this approach is 
poor with regards to solution quality. Let iVxsp(T) be a set of tours which can be obtained 
from the tour T by reordering the vertices in T. Observe that one has to solve a TSP 
instance induced by T to find the best tour in Njsp{T). 

Let Nco{T) be a set of all the GTSP tours which visit the clusters in exactly the same 
order as in T. The size of the Nco{T) neighborhood is []" i \Ci\ e 0{s'^) but there 
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exists a polynomial algorithm (we call it Cluster Optimizatio n, CO) which finds the best 



tour in NcoiT) in 0{ms'^) operations jFischetti et al.Lll997l) . Moreover, it requires only 



0{ms^ ■ mini time, i.e., if the instance has at least one cluster of size 0(1), CO 
proceeds in O(ms^). (Recall that s is the size of the largest cluster: s = max^ \Ci\.) 

The following theorem shows that splitting the GTSP into two problems (local search 
in A'tsp(7') and then local search in Nco{T)) does not guarantee any solution quality. 

Theorem 1. The best tour among NcoiT)UNTSpiT) can be a longest GTSP tour different 
from a shortest one. 

Proof. Consider the GTSP instance G in Figure |2a] It is a symmetric GTSP containing 
5 clusters {1}, {2, 2'}, {3}, {4} and {5}. The weights of the edges not displayed in the 
graph are as follows: w{l 3) = w{l — > 4) = and w{2 — > 5) = w{2' — > 5) = 1. 

Observe that the tour r = l— !>2— !>3^4— >5— shown in Figure|2b] is a local 
minimum in both Nco{T) and Nt:s,p{T). The dashed line shows the second solution in 
Nco{T) but it gives the same objective value. It is also clear that T is a local minimum 
in Ntsp{T). Indeed, all the edges incident to the vertex 2 are of weight 1, and, hence, any 
tour through the vertex 2 is at least of weight 2. 

The tour T is in fact a longest tour in G. Observe that all nonzero edges in G are 
incident to the vertices 2 and 2'. Since only one of these vertices can be visited by a tour, 
at most two nonzero edges can be included into a tour Hence, the weight of the worst 
tour in G is 2. 

However, there exists a better GTSP tour Topt = 1^2'^4^3^5^1of 
weight 1, see Figure [2a] 
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(a) The instance G and the optimal GTSP tour (b) A local minimum T which is the worst pos- 
Topt. sible GTSP tour 

Figure 2: An example of a local minimum in both N-ys?^) and Nco{T) which is a 
longest possible GTSP tour. 

□ 

In fact, the TSP and the GTSP behave quite differently during optimization. Observe 
that there exists no way to find out quickly if some modification of the cluster order im- 
proves the tour Indeed, choosing wrong vertices within clusters may lead to an arbitrary 
large increase of the tour weight. And since a replacement of a vertex within one cluster 
may require a replacement of vertices in the neighbor clusters, any local change influences 
the whole tour in general case. 

3.1. Local Search Adaptation 

A typical local search with the neighborhood iV(T) performs as follows: 
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Require: The original solution T. 
for all T' e N{T) do 
if w(r') < w{T) then 
T -h- v. 

Run the whole algorithm again, 
return T. 

Let Ni{T) C iVTSp(T) be a neighborhood of some TSP local search LSi{T). Let 
7V2(r) C Nco{T) be a neighborhood of some GTSP local search LS2{T) which leaves 
the cluster order fixed. Then one can think of the following two adaptations of a TSP local 
search for the GTSP: 

(i) Enumerate all solutions T' G Ni(T). For every candidate T' run T' ^ LS2{T') to 
optimize it in N2{T'). 

(ii) Enumerate all solutions T' e N2iT). For every candidate T' run T' ^ LSi{T') to 
optimize it in Ni{T'). 

Observe that the TSP neighborhood A'^i (T) is normally harder to explore than the clus- 
ter optimization neighborhood N2iT). Consider, e.g., A^i(r) = Ntsp{T) and A^2(T) = 
Nco{T). Then both options yield an optimal GTSP solution but Option ^ requires 
0{m\ms'^) operations while Option ^ requires 0(s"'to!) operations. 

Moreover, many practical applications of the GTSP have some localization of clusters, 
i.e., \w{x — >■ yi) — w{x j/2)| ^ w{x — > yi) on average, where Cluster{yi) = 
Cluster {y2) ^ Cluster (x). Hence, the landscape of N2{T) depends on the cluster order 
more than the landscape of A^i (T) depends on the vertex selection. From above it follows 
that Option ^ is preferable. 



Option ^ was used by IHu and Raidll ( l2008h as follows. The cluster optimization 



neighborhood N2 (T) includes there all the tours which differ from T in exactly one ver- 
tex. For every T' G N2{T) the Lin-Kernighan heuristic was applied. This results in n 
runs of the Lin-Kernighan heuristic which makes the algorithm unreasonably slow. 

Option ^ may be implemented as follows: 
Require: The original tour T. 
for all r e iVi(r) do 

T' QuickImprove{T'). 
if w(T') < w{T) then 
T <— SlowImprove{T'). 
Run the whole algorithm again, 
return T. 

Here QuickImprove{T) and SlowImprove{T) are some tour improvement heuristics 
which leave the cluster order unchanged. Formally, these heuristics should meet the fol- 
lowing requirements: 

• QuickImprove{T) , SlowImprove{T) G Nco{T) for any tour T; 

• w{QuickImprove{T)) < w{T) axvA w{SlowImprove{T)) < 'w{T) for any tour T. 

Quicklmprove is applied to every candidate T' before its evaluation. Slowlmprove is 
only applied to successful candidates in order to further improve them. One can think of 
the following improvement functions: 
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• Trivial I{T) which leaves the solution without any change: I{T) = T. 



Full optimization CO{T) which applies the CO algorithm to the given solution. 



• Local optimization L{T). It updates the vertices only within clusters, affected by 
the latest solution change. E.g., if a tour xi X2 ^ x^^ ^ X4 — > xi was changed 
to zi -> 2-3 — > X2 — > X4 — !> xi, some implementation of L{T) will try every 



Xl 



Xi 



Xl, where G Cluster{x2) and £ Cluster{x3). 



There are five meaningful combinations of Quicklmprove and Slowlmprove: 

1. Quicklmprove (T) = I{T) and Slowlmprove (T) = I{T). This actually yields the 
original TSP local search. 

2. Quicklmprove (T) = I{T) and Slowlmprove (T) — CO{T), i.e., the algorithm 
explores the TSP neighborhood but every time an improvement is found, the solu- 
tion T is optimized in Nco{T). One can also consider SlowImprove{T) = L{T), 
but it has no practical interest. Indeed, Slowlmprove is used quite rarely and so its 
impact on the total running time is negligible. At the same time, COiT) is much 
better than L{T) with respect to solution quality. 

3. Quicklmprove (T) = L{T) and SlowImprove{T) = I{T), i.e., every solution 
T' S N(T) is improved locally before it is compared to the original solution. 

4. Quicklmprove (T) ~ L{T) and SlowImproveiT) = CO{T), which is the same 
as Option[3]but it additionally optimizes the solution T' globally in Nco{T') every 
time an improvement is found. 

5. Quicklmprove (T) = CO{T) and SlowImprove{T) = I{T), i.e., every candidate 
T' G N{T) is optimized globally in Nco{T') before it is compared to the original 
solution T. 

These adapt ations were widely applie d in the literature. For example, the heuris- 
tics G2 and G3 jRenaud and Boctor , 1998 ) are actually 2-opt and 3-opt adapted accord- 
ing to Option |5] An im provement over the n aive implementation of 2-opt adapted in 
this way is proposed bv IHu and Raidll (120081) : asymptotically, it is faster by factor 3. 
However, this approach is still too slow. Adaptations of 2-opt a nd some other heuris- 
tics ac c ording to Option [3] we r e used by Fischetti et alj ( 19971). Gutin and K arapetvari 



( 201c ), ISilberholz and GoldenI (|2007|) . [Snvd er and DaskinI ( I2OO6I) . and iTasgetiren et al 
(l2007h . Some unadapted TSP loca l searches (Option [T|) were us ed bv Bontoux et al. 



( 201c ), Gutin and Karapetvan ( 2010l) . Silberholz and Golden ( 2007 ), and Snvder and Daskin 
(2006). 



3.2. Adaptation of LKtsp 

In this section we present our adaptation LK of LKtsp for the GTSP. A pseudo-code 
of the whole heuristic is presented in Algorithmic Some of its details are encapsulated 
into the following functions (note that LKtsp is not a typical local search based on some 
neighborhood and, thus, the framework presented above cannot be applied to it straight- 
forwardly): 

• Gain{P, a; — > ?/) is intended to calculate the gain of breaking a path P at an edge 
X ^y. 
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Algorithm 2 LK general implementation 



Require: The original tour T. 

Initialize the number of idle iterations t <— 0. 
while i < m do 

Cyclically select the next edge e b eT. 

Let Po = h ^ . . . ^ ehe the path obtained from T by removing the edge e — >■ 6. 
Run T' <— ImprovePath{Po , 1, 0) (see below), 
if w(r') < w{T) then 

Set T = ImproveTour{T'). 

Reset the number of idle iterations i ^ 0. 
else 

Increase the number of idle iterations t ^ i + 1 . 



Procedure ImprovePath{P, depth, R) 

Require: The path P = 5 — > . . . e, recursion depth depth and the set of restricted 
vertices R. 
if depth > a then 

Find the edge x-^yEP, xj^b, x^R such that it maximizes the path gain 
Gain{P, x ^ y). 
else 

Repeat the rest of the procedure for every edge x-^yEP,x^b, x^R. 

Conduct the local search move: P <— RearrangePath{P, x —5- y). 
if GainIsAcceptable{P, x y) then 
Replace the edge x y with a; e in P. 
T' = CloseUp{P). 
itw{T') > w(T) then 

Run T' 4- ImprovePath{P, depth + 1, i? U {x}). 
ifw(r') < w(T) then 

return P'. 
else 

Restore the path P. 
return P. 



• RearrangePath{P, x — > y) removes an edge x ^ y from a path P and adds 
the edge x e, where P = 6^...— >a;— ^y— ^...^-e, see Figure [T] 
Together with CloseUp, it includes an implementation of QuickImprove{T) (see 
Section ITTT i. so RearrangePath may also apply some cluster optimization. 

• GainIsAcceptable{P, x y) determines if the gain of breaking a path P at an 
edge X — y is worth any further effort. 

• CloseUp{P) adds an edge to a path P to produce a feasible tour. Together with 
RearrangePath, it includes an implementation of QuickImprove{T) (see Sec- 
tion l3.1l l, so CloseUp may also apply some cluster optimization. 
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• ImproveTour{T) is a tour improvement function. It is an analogue to SlowImprove{T) 
(see Section lTTI ). 

These functions are the key points in the adaptation of LKtsp for the GTSP. They 
determine the behaviour of the heuristic. In Sections l331l3.4l and l3.5l we describe different 
implementations of these functions. 

3.3. The Basic Variation 

The Basic variation of LKtsp (in what follows denoted by B) is a trivial adapta- 
tion of LK according to Option [T] (see Section 13.11 ). It defines the functions Gain, 
RearrangePath, CloseUp and ImproveTour as follows: 

Gain^{b — > . . . — > e, x ^ y) = w{x — w{e — >■ x) , 

RearrangePath^ {b^...^x^y^...^e, x^y) = b^...^x^e^...-^y, 

Close Upg {b ^ . . . ^ e) = b ^ . . . ^ e ^ b , 

and ImproveTour ^{T) is trivial. We also consider a B™ variation (Option |2l) which 
appUes CO every time an improvement is found: ImproveTour (T) — CO{T). 

The implementation of GainIsAcceptable{G , P) will be discussed in Section [331 



3.4. The Closest and the Shortest Variations 

The Closest and the Shortest variations (denoted as C and S, respectively) are 
two adaptations of LKtsp according to Option [3] i.e., QuickImprove{T) = L{T) and 
SlowImprove{T) = I{T). In other words, a local cluster optimization is applied to every 
candidate during the path optimization. 

Consider an iteration of the path improvement heuristic ImprovePath. Let the path 
P = 6— >j/^-...^ebe broken at the edge x ^ y (see Figure [3]). Then, 




Figure 3: Path optimization. 



to calculate Gain{P, x ^ y) in C, we replace x E X with x' E X such that the edge 
a; — !• e is minimized: 

Gainc{b — > . . . ^> p — > .t — > y ^> . . . ^ e, x ^ y) 

= w{p — > X — > y) — w{p x' ^ e) , 

where x' G Cluster{x) is chosen to minimize w{x' e). 

In S, we update both x and e such that the path ?>e— ?>ris minimized: 

Gain^{b — >y— )'...^>r— >e, x ^ y) = 

w{p X ^ y) + w{r — >■ e) — w{p — !> x' — > e' — >■ r) , 
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where x' £ Cluster [x] and e' G Cluster [e] are chosen to minimize w{p x' e' ^ 
r). 

Observe that the most time-consuming part of LK is the path optimization. In case of 
the S variation, the bottleneck is the gain evaluation function which takes O(s^) opera- 
tions. In order to reduce the number of gain evaluations in S, we do not consider some 
edges a; — > ?/. In particular, we assume that the improvement is usually not larger than 

Wmin{X, Y) — Wniin{X, E), where X ~ Cluster{x), Y — Cluster{y), E = Cluster{e) 
and WnuniA^B) is the weight of the shortest edge between some clusters A and B: 
Wm\n{A,B) = min w(a — > 6) . Obviously, all the values Wn-iin(A, B) are precal- 

aeA,beB 

culated. Note that this speed-up heuristic is used only when depth > a, see Algorithmic 
One can hardly speed up the Gain function in B or C. 

The RearrangePath function does some further cluster optimization in the C varia- 
tion: 

RearrangePathQ^b >p—>a;—>j/—>...—>e, x ^ y) 

= 6— !>...— !-p—>a;'—>e— >...—>?;, 

where x' G Cluster{x) is chosen to minimize the weight w{j) x' ^ e). In S it just 
repeats the optimization performed for the Cain evaluation: 

RearrangePath^{b — >r—>-e, x y) 

= &—>...—^p—>a;'—>e'~>7'— >...—>?/, 

where x' e Cluster{x) and e' G Cluster{e) are chosen to minimize w{p — !• x' -> e' — 
r). 

Every time we want to close up the path, both C and S try all the combinations of the 
end vertices to minimize the weight of the loop: 

Close Upq ^{b ^ p ^ . . . ^ q ^ e) = b' ^ p ^ . . . ^ q ^ e ^ b' : 

b' G Cluster{b), e' G Cluster{e) and w{q — > e' — !■ 6' — s- p) is minimized . 

We also implemented the C^" and S*^" variations such that CO is applied every time a 
tour improvement is found (see Option]?] above): ImproveTour{T) = CO{T). 

3.5. The Exact Variation 

Finally we propose the Exact (E) variation. For every cluster ordering under consid- 
eration it finds the shortest path from the first to the last cluster (via all clusters in that 
order). After closing up the path it always applies CO (see Option]5]above). However, it 
explores the neighborhood much faster than a naive implementation would do. 

The Gain function for E is defined as follows: 

Cain^[b — >...—>a;—>-2;— e, x ^ y) ~ 

Wco{b — >...— >x^e^...^'?/)— Wco{b — i-...—5'a::—;>2y— 5- e), 
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where WmiP) is the weight of the shortest path through the corresponding clusters: 

Wco{xi ^ X2 ^ ■ . ■ ^ x„i) = min w{x[ X2 ^ ■ ■ . ^ x'^) . 

x'^ ^ Cluster{x i) , . . . ,m 

Note that ImprovePath runs this function sequentially for every a; — > y G P. In case of 
a naive implementation, it would take 0{'m?s^) operations. Our implementation requires 
only 0{ms^) operations but in practice it is much faster (almost 0{ms^)). Also note that 
typically m ^ s. 

Our implementation proceeds as follows. Let Xi, X2, . . . , Xm be the sequence of 
clusters in the given path (see Figure l4ab. Let be the length of the shortest path from 



V e X, 
X, 



ly 



qex 



i+1 



X 



e G X.„ 

Xri 



(a) The original sequence of clusters Xi , X2 , 



Xr] 



The value l^, denotes the shortest path 



from the cluster Xi through X2, X-^, . . ., Xj — i to the vertex v S Xj. It takes 0{\Xj-i\\Xj\) 
operations to calculate all ly for some j. Value denotes the shortest path from the vertex e G X,n 
through Xjn-i, Xm-2, to the vertex q 6 Xj+i. It takes 0{\Xm\\Xj-t-2\\Xj+i\) 

operations to calculate all for some j. 



veXj 



ly 



q e Xj+1 



ueXt 



e€ X„ 

Xr, 



■ Xrr 



Xi 



(b) An improved algorithm. Let cluster Xt be the smallest cluster among Xjj^2, ^j+z. 
To calculate all the shortest paths from u £ Xt to g G ^j+l Xt-i, Xt—2, ■ ■ 
one needs 0{\Xt \ \Xjj^2 1 l^j+i I) operations for some j, i.e., it is | times faster than the 

straightforward calculations. The values are calculated as previously, see Figure [(a)] 



j+2. 



ly 




(c) The sequence of clusters after the local search move. To find the shortest path from Xi to Xj^i 
via X2, X:j, Xj, Xm, Xm—i, ■■, ^3+2, we need to find all the shortest paths from Xi 
to every e 6 Xm as l'^ = min^{Z„ + w{v — ^ e)} in O(s^) operations, then find all the shortest 
paths l'^ from Xi to every u S Xt as l'^ = raine{l'^ + l^} in O(s^) operations and, finally, find 
the whole shortest path I' from Xi to Xj^i as I' = icnmu,q{l'y + } in O(s^) operations. 

Figure 4: A straightforward and an enhanced implementations of the E variation. 



Xi to V G Xj through the cluster sequence X2, X3, .. ., Xj^i. 

ly = min w{xi — > a;2 — > • . • — > Xj-i v) . 

Xi£Xi,i—l,...,j — l 
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It takes Oi^s^rn) operations to calculate all Z„ using the algorithm for the shortest path in 
layered networks. 

Let be the length of the shortest path from e G to g G -'^j+i through the cluster 
sequence X,„_2, ■ ■ ■ , 

It = min w{e ^ Xm-i ^ .■e,-+2 q) ■ 

Xi^Xi,i—j-{-2,...,m—l 

It takes 0{s^m) operations to calculate all using the algorithm for the shortest path in 
layered networks. 

As a further improvement, we propose an algorithm to calculate which also takes 
O(s^m) operations in the worst case but in practice it proceeds significantly faster. 

Note that a disadvantage of a straightforward use of the shortest path algorithm to find 
is that its performance strongly depends on the size of ; indeed, the straightforward 
approach requires |Xm||-'i^j+2||-'^j"+i| operations for every j. Assume \Xt\ < \Xm\ for 
some t, j + 1 < t < m, and we know the values for every u Q Xt (see Figure |4b]i. 
Now for every j < t ~ 1 we only need to calculate 1^, where u € Xt and q G Xj^^-i. This 
will take operations for every j, i.e, it is |Xm|/|-Yt| times faster than the 

straightforward approach. A formal procedure is shown in Algorithm[3j 

Algorithm 3 Calculation of the shortest paths and for E. 

Require: The sequence of clusters Xi, X2, ■■ ■ , Xm- 
for every e G Xm and every q G ^m-i do 
^ w{e q). 

Y ^ X,n. 

for j m — 3, TO — 4, . . . , 1 do 
if |Xj+2| < |y| then 
if r 7^ X„ then 

for every e G X^ and every u G ^j+2 do 

^miny6i'{^y + ^^}- 
Y ^ X,+2. 
for every j/ G F and every q G X^+i do 



Having all and l^, where v G Xj, q G Xj^i, e G X„l and u G Xt, j + 1 < t < 
TO, one can find the shortest path through all the clusters Xi, X2, ■ ■ ■ , Xj, Xm, Xm-i, 
... , Xj+i in O(s^) time, see Algorithm |4] and Figure l4cl 

In our experiments this speed-up heuristic decreased the running time of the E algo- 
rithm by 30% to 50%. 

The RearrangePath function for E replaces the edge x ^ y with a; — > e and opti- 
mizes the vertices in the path: 

RearrangePath^{b .x —!> y ^> ...—>■ e) = 6' a;' j/' —)■...—;> e' , 

where all the vertices are selected to minimize the weight of the resulting path. The 
Close Up function for E simply applies CO to the tour: 

CloseUp^{h ...^e) = CO{b . . . ^ e ^ b) . 
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Algorithm 4 Calculation of the whole shortest path for E. 



Require: The index j. 

Require: The values ly, and 1^, where v E Xj, q G Xj+i, e G Xm and u G Xt, 

j + 1 < t < m. 

Calculate l^. <— minugx^ ly + w{v e) for every e G Xm- 
if t < TO then 

Calculate ly <— min^^Xm. + It every u E Xt. 

Calculate Iq -s— niin^gjc^ Z„ + for every q G -'^^j+i- 
else 

Calculate Iq ^ niiiiegjc^ Ze + Iq for every q G Xj+i. 
return inin,gXj+i 



Observe that, unlike other adaptations of the original LKtsp heuristic. Exact is natu- 
rally suitable for asymmetric instances. 



Note that another approach to implement the CO algorithm is proposed by Pop (l2007h 



It is b ased on an integer formulation of the GTSP; a more general case is studied in (IPop et al 



2006h . However, we believe that the dynamic programming approach enhanced by the 



improvements discussed above is more efficient in our case. 

3.6. The Gain Function 

The gain is a measure of a path improvement. It is used to find the best path improve- 
ment and to decide whether this improvement should be accepted. To decide this, we 
use a boolean function GainIsAcceptable{P, x y). This function greatly influences 
the performance of the whole algorithm. We propose four different implementations of 
Gainis Acceptable (P, x y) in order to find the most efficient ones. For the notation, 
see Algorithm|2] 

1. GainIsAcceptable{P, x — > y) = w{P) < w{Po), i.e., the function accepts any 
changes while the path is shorter than the original one. 

2. Gainis Acceptable{P, x — > y) = it;(P) + ^^^^ < w{T), i.e., it is assumed that an 
edge of an average weight ^^^J will close up the path. 

3. GainIsAceeptable{P, x ^ y) ^ w{P) + w{x y) < w{T), i.e., the last 
removed edge is 'restored' for the gain evaluation. Note that the weight of the edge 
a; — 7> y cannot be obtained correctly in E. Instead of w{x — > y) we use the weight 
Wmin{X, Y) of the shortest edge between X = Cluster(x) and Y = Cluster{y). 

4. GainIsAcceptable{P, a: — ?► y) = ■w{P) < w{T), i.e., the obtained path has to 
be shorter than the original tour In other words, the weight of the 'close up edge' 
is assumed to be 0. Unlike the first three implementations, this one is optimistic 
and, hence, yields deeper search trees. This takes more time but also improves the 
solution quality. 

5. GainIsAcceptable{P, a; — 5- y) = 'w{P) + ^^'^ < w{T), i.e., it is assumed that 
an edge of a half of an average weight will close up the path. It is a mixture of 
Options |2] and m 
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4, Experiments 



In order to select the most successful variations of the proposed heuristic and to prove 
its efficiency, we conducted a set of computational experiments. 



Our test bed includes several TSP instances taken from TS PLIB (IReinelt , Il991h con 



verted into the GTSP by the standard clustering procedure of 



Fischetti et al 



(119971) rthe 



same approach is w idely use d in the hterature, s e e, e.g ., tautin and Karapetvanl 2010l; 
Silberholz and Goldeni, ,2007 ; Snyder and Daskirii 2006t Tasgetiren et al.. 2007) ) . Lik e 
Bontoux et al.l (I2OI0I) . iGutin and Karapetvanl (l2010t) . and ISilberholz and GoldenI (|2007|) . 
we do not consider any instances with less than 10 or more than 217 clusters (in other 
papers the bounds are stricter). 

Every instance name consists of three parts: 'm t n\ where m is the number of 
clusters, t is the type of the original TSP instance (see ( IReineltl Il99lh for details) and n 
is the number of vertices. 

Observe that the optimal sol utions are known only for some instances with up to 
89 clusters ( iFischetti et al lll997l). For the rest of the instances we use the best known 
solutions, see ( Bontoux et al. , 2010l: Gutin and Karapetvaru,. 20 10;. Silberholz and Golde"ni 
2007h. 



The following heuristics were included in the experiments: 

1. The Basic variations, i.e., and B^'=°, where a 6 {2, 3, 4} and x e {1, 2, 3, 4, 5} 
define the backtracking depth and the gain acceptance strategy, respectively. The 
letters 'co' in the superscript mean that the CO algorithm is applied every time a 
tour improvement is found (for details see Section lTTI i. 

TheClosestvariations,i.e., C;;!andC;j™, where a G {2, 3, 4} and .t e {1,2,3,4,5}. 
The Shortest variations, i.e., S2 and S2'°, where a £ {2, 3, 4} and x G {1,2,3,4,5}. 
The Exact variations, i.e., E", where a G {1, 2, 3} and x G {1, 2, 3, 4, 5}. 
Adaptations of the 2-opt (2o) and 3-opt (3o) local searches according to Sec- 
tion O 

A state-of-the-art memetic algorithm ma by Gutin and Karapetvan ( 2010l) . 



Observe that m a dominates all other GTSP metaheuristics known from the litera- 
tu re. In particu lar, 'Gutin and Karapetvan' ('2010') compare it to the heuristics propose d 
bv lSilberhoizan d Golden (2007), Snyder and Daskin (200l) and lTasgetiren et al.l (l2007h . 
and it appears that ma dominates all these algorithms in every experiment with respect 
to both solution quality a nd running ti me. Similarly, one can see that it do minates two 
more recent algorithms by Bontoux et al. ( 2010 ) and Tasgetiren et al. ( 2010l) in every ex- 
periment. Note that the running times of all these algorithms were normalized according 
to the computational platforms used to evaluate the algorithms. Hence, we do not include 
the results of the other metaheuristics in our comparison. 

In order to generate the starting tour for the local search procedures, we use a sim - 
plified Nearest Neighbour construction heuristic (NN). Unlike proposed bv lNoonI ( Il988h . 
our algorithm tries only one starting vertex. Trying every vertex as a starting one signif- 
icantly slows down the heuristic and usually does not improve the solutions of the local 
searches. Note that in what follows the running time of a local search includes the running 
time of the construction heuristic. 
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All the heuristics are implemented in Visual C++. The evaluation platform is based 
on an Intel Core i7 2.67 GHz processor 

The experimental results are presented in two forms. The first form is a fair compe- 
tition of all the heuristics joined in one table. The second form is a set of standard tables 
reporting solution quality and running time of the most successful heuristics. 

4.1. Heuristics Competition 

Many researchers face the problem of a fair comparison of several heuristics. Indeed, 
every experiment result consist of at least two parameters; solution error and running time. 
It is a trade-off between the speed and the quality, and both quick (and low-quality) and 
slow (and high-quality) heuristics are of interest. A heuristic should only be considered 
as useless if it is dominated by another heuristic, i.e., it is both slower and yields solutions 
of a lower quality. 

Hence, one can clearly separate a set of successful from a set of dominated heuristics. 
However, this only works for a single experiment. If the experiment is conducted for 
several test instances, the comparison becomes not obvious. Indeed, a heuristic may be 
successful in one experiment and unsuccessful in another one. A natural solution of this 
problem is to use averages but if the results vary a lot for different instances this approach 
may be incorrect. 

In a fair competition, one should compare heuristics which have similar running times. 
For every time Ti e {0.02 s, 0.05 s, 0.1 s, 0.2 s, . . . , 50 s} we compare solution quality of 
all the heuristics which were able to solve an instance in less than n. In order to further 
reduce the size of the table and to smooth out the experimental results, we additionally 
group similar instances together and report only the average values for each group. 

Moreover, we repeat every experiment 10 times. It requires some extra effort to ensure 
that an algorithm H proceeds differently in different runs, i.e., iJ' (/) ^ [I] in general 
case, where i and j are the run numbers. For ma'^ the run number r is the random 
generator seed value. In NN'', we start the tour construction from the vertex C^.i- i e., 
from the first vertex of the rth cluster of the instance. This also affects all the local 
searches since they start from the NN'' solutions. 

Finally we get Table[T] Roughly speaking, every cell of this table reports the most suc- 
cessful heuristics for a given range of instances and being given some limited time. More 
formally, let r = {ti , T2 , . . . } be a set of predefined time limits. Let I = {Xi , I2 , • • ■ } be 
a set of predefined instance groups such that all instances in every Ij have similar diffi- 
culty. Let 7^ be a set of all heuristics included in the competition. H{I)ii^^ and iJ(/)enoi 
are the running time and the relative solution error, respectively, of the heuristic H ^ H 
for the instance I E I: 

_ w{H{I)) - Z/;(/best) 

where /best is the optimal or the best known solution for the instance /. -ff(2j)time and 
iJ(X, )enoi denote the corresponding values averaged for all the instances / G Ij and all 
rG{l,2,...,10}. 

For every cell i, j we define a winner heuristic Winner i,j G "H as follows: 

1. Winnerl j{I)tx^s < Ti for every instance / G Ij and every r G {1, 2, . . . , 10}. 
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2. Winner i J {Xj)saor < Winneri^i^j{Ij)aTor (it is only applicable if i > 1). 

3. If several heuristics meet the conditions above, we choose the one with the smallest 

-^j,j(-^j)eiTor- 

4. If several heuristics meet the conditions above and have the same solution quality, 
we choose the one with the smallest Hi j (X, )time- 

Apart from the winner, every cell contains all the heuristics H E H meeting the 
following conditions: 

1. i/''(/)time < Ti for every instance / e Ij and every r E {1,2,..., 10}. 

2. i/(X,)eiTor < Winneri^i J {Ij)aTor (it is only applicable if i > 1). 

eiTor ■ 

4. fl"(I,)tinie < 1-2 • Winnen,j{Ij)nme- 

Since LK is a powerful heuristic, we did not consider any instances with less than 30 
clusters in this competition. Note that all the smaller instances are relatively easy to solve, 
e.g., ma was able to solve all of them to optimality in our experiments, and it took only 
about 30 ms on average, and for Sf^° it takes, on average, less than 0.5 ms to get 0.3% 
error, see Table [3] 

We use the following groups Ij of instances: 
Tiniest: 30chl50, 30kroA150, 30kroB150, 31prl52, 32ul59 and 39ratl95. 
Tiny: 40kroa200, 40krob200, 41gr202, 45ts225, 45tsp225 and 46pr226. 
Small: 46gr229, 53gil262, 56a280, 60pr299 and 641in318. 
Moderate: 80rd400, 84fl417, 87gr431, 88pr439 and 89pcb442. 
Large: 9 9d4 93, 107att5 32, 107ali5 35, 113pa5 61, 115u574 and 115rat575. 
Huge: 132d657, 134gr666, 145u724 and 157rat783. 
Giant: 200ds jlOOO, 201prl002, 212ul060 and 217vml084. 

Note that the instances 35sil75, 36brgl80, 40dl98, 53pr264, 107si535, 
131p65 4 and 207sil032 are excluded from this competition since they are signif- 
icantly harder to solve than the other instances of the corresponding groups. This is 
discussed in Section|4|2]and the results for these instances are included in Tables |4]and|5] 

One can see from Table[T]that there is a clear tendency: the proposed Lin-Kernighan 
adaptation outperforms all the other heuristics in a wide range of trade-offs between solu- 
tion quality and running time. Only the state-of-the-art memetic algorithm ma is able to 
beat LK being given large time. There are several occurrences of 2-opt in the upper right 
corner (i.e., for Huge and Giant instances and less than 5 ms time) but this is because this 
time is too small for even the most basic variations of LK. Note that 2o^ and 20b° denote 
the 2-opt local search adapted for the GTSP according to Options [T| and |2] respectively, 
see Section im 

Clearly, the most important parameter of LK is its variation, and each of the four 
variations (Basic, Closest, Shortest and Exact) is successful in a certain running time 
range. B wins the competition for small running times. For the middle range of running 
times one should choose C or S. The E variation wins only in a small range of times; 
having more time, one should choose the memetic algorithm ma. 

Here are some tendencies with regards to the rest of the LK parameters: 

• It is usually beneficial to apply CO every time a tour improvement is found. 
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Table 1: The fair competition. Every cell reports the most successful heuristics being 
given some limited time (see the first column) for a given range of instances (see the 
header). Every heuristic is provided with the average relative solution error in percent. 
To make the table easier to read, all the B and E adaptations of LK are selected with bold 
font. All the cells where the dominating heuristic is C or S are highlighted with grey 
background. 



Tiniest Tiny Small Moderate Large Huge Giant 

c2co , , ^f" 6.1 

<2ms Sf° 1.2 Cl"^" 1.0 Cf ° 3.5 pio ^'^ Bf ° 7.8 2o™13.4 20b22.7 

^5 Bf» 6.5 

















Bf« 9.5 




< 5 ms 


S^'^" 0.0 


C^"^" 0.5 


C^" 1.2 
Sj"^" 1.2 


C^'^" 2.4 


B4co 
B2C0 


7.2 
7.3 


Bf " 9.6 
B^^lO.l 
B^™10.3 


2o™14.3 



<10ms — — C^™ 0.8 Cj"-" 1.3 Cl'=° 2.9 To Bf ° 7.' 







Cf" 0.8 


1.3 


Cf° 2.9 


Cf° 


6.1 
6.3 


O3co n A 



<20ms — S''-" 0.5 o5co n ^ C^'-" 1.3 ° 2.4 C^"^" 4.0 

0-) U.J 



< 50 ms — 





q4 
^4 


0.2 




0.2 


1.1 


s? 

C2co 
^4 


2.2 
2.2 


g2co 
Q3co 


2.9 
3.0 




4.0 






S4CO 


0.2 


g4co 


0.0 




Cf° 


1.0 


Cf° 


1.7 


Sico 


3.0 



< 0.1 s 



< 0.2 s — — — 0.6 — — Sf° 1.9 
<0.5s — ma 0.0 — — — S^"^" 1.2 — 

< 1 s — — — 0.4 — 1.0 S^™ 1.2 

< 2 s — — — — — 1.0 — 

< 5 s — — — ma 0.0 E| 0.8 E^ 0.8 — 

< 10 s — — — — ma 0.0 — — 
<20s — — — — — ma 0.1 — 
<50s — — — — — — ma 0.2 



• The most successful gain acceptance options are|4]and|5](see Section lsTSI l. 

• The larger the backtracking depth a, the better the solutions. However, it is an 
expensive way to improve the solutions; one should normally keep a G {2, 3, 4}. 

Table [H however, does not make it clear what parameters one should use in practice. 
In order to give some advice, we calculated the distances d{H) between each heuristic 
H ^ T-L and the winner algorithms. For every column j of Table[T]we calculated dj{H): 

en"or 

^ Winnerij{Ij)en-oi- 
where i is minimized such that H^{I)ame < for every / e Ij and r e {1,2,..., 10}. 
Then dj {H) were averaged for all j to get the required distance: d{H) = dj (H). The list 
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of the heuristics H with the smallest distances d{H) is presented in Table |2] In fact, we 
added 20b°, B?™ and E| to this list only to fill the gaps. Every heuristic H in Table |2] is 
also provided with the average running time T{H), in % of ma running time: 



T{H) = T{H, /, r) is averaged for all the instances / e Z and all r e {1, 2, . . . , 10} 

^'^(^)time 



where T{H, /, r) 



MA{I\i 



and Af^(/)time = AL4''(/)time is averaged for all r 6 {1, 2, . . . , 10} , 



Table 2: The list of the most successful heuristics. The heuristics H are ordered according 
to their running times, from the fastest to the slowest ones. 20b° denotes the 2-opt local 
search adapted for the GTSP according to Option|2l see Section lTTl 



H 


d{H), % 


Time, % of ma time 


20- 


44 


0.04 


g2co 


34 


0.10 


Q2co 


12 


0.40 


S2co 


19 


0.97 


S3co 


19 


2.53 


g4co 


35 


8.70 


q4co 
^4 


32 


15.34 


■=4 


56 


43.62 


ma 





100.00 



4.2. Detailed Data For Selected Heuristics 

In this section we provide the detailed information on the experimental results for 
the most successful heuristics, see Section [43] Tables |3] H] and |5] include the following 
information: 

• The 'Instance' column contains the instance name as described above. 



• The 'Best' column contains the best known or optimal (iFischetti et al.Lll997h ob- 
jective values of the test instances. 

• The rest of the columns correspond to different heuristics and report either relative 
solution error or running time in milliseconds. Every value is averaged for ten runs, 
see Section|4T|for details. 



• The 'Average' row reports the averages for all the instances in the table. 

• The 'Light avg' row reports the averages for all the instances used in Section|4T| 

• Similarly, the 'Heavy avg' row reports the averages for all the instances (m > 30) 
excluded from the competition in Section l4~n 
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All the small instances (m < 30) are separated from the rest of the test bed to Table[3] 
One can see that all these instances are relatively easy to solve; in fact several heuristics 
are able to solve all or almost all of them to optimality in every run and it takes only a 
small fraction of a second. A useful observation is that solves all the instances with up 
to 20 clusters to optimality, and in this range E4 is significantly faster than ma. 

As regards the larger instances (m > 30), it is worth noting that there exist several 
'heavy' instances among them: 35sil75, 3 6brgl80, 4 0dl98, 5 3pr2 64, 107si535, 
131p65 4 and 207sil032. Some heuristics perform extremely slowly for these in- 
stances; the running time of S^*^", Sj*^", 84'^° and E4 is 3 to 500 times larger for every 
'heavy' instance than it is for the other instances of a similar size. Other LK variations 
are also affected, though, this mostly relates to the ones which use the 'optimistic' gain 
acceptance functions (Options|4]and|5]l, see Section [376l 

Our analysis has shown that all of these instances have an unusual weight distribution. 
In particular, all these instances have enormous number of 'heavy' edges, i.e., the the 
weights which are close to the maximum weight in the instance, prevail over the smaller 
weights. Recall that LK bases on the assumption that a randomly selected edge will 
probably have a 'good' weight. Then we can optimize a path in the hope to find a good 
option to close it up later. However, the probability to find a 'good' edge is low in a 
'heavy' instance. Hence, the termination condition GainlsAcceptable does not usually 
stop the search though a few tour improvements can be found. This, obviously, slows 
down the algorithm. 

Note that a similar result was obtained bv lKarapetvan and Gutin (l2010l) for the adap- 
tation of the Lin-Kernighan heuristic for the Multidimensional Assignment Problem. 

Observe that such 'unfortunate' instances can be easily detected before the algorithm's 
run. Observe also that even the fast heuristics yield relatively good solutions for these 
instances (see Tables H and |5]). Hence, one can use a lighter heuristic to get a reasonable 
solution quality in a reasonable time in this case. 



5. Conclusion 

The Lin-Kernighan heuristic is known to be a very successful TSP heuristic. In this 
paper we present a number of adaptations of Lin-Kernighan for the GTSP. Several ap- 
proaches to adaptation of a TSP local search for the GTSP are discussed and the best ones 
are selected and applied to the Lin-Kernighan heuristic. The experimental evaluation con- 
firms the success of these approaches and proves that the proposed adaptations reproduce 
the efficiency of the original TSP heuristic. 

Based on the experimental results, we selected the most successful Lin-Kernighan 
adaptations for different solution quality /running time requirements. Only for the very 
small running times (5 ms or less) and huge instances (132 clusters and more) our heuris- 
tic is outperformed by some very basic local searches just because none of our adaptations 
is able to proceed in this time. For the very large running times, the Lin-Kernighan adap- 
tations are outperformed by the state-of-the-art memetic algorithm which usually solves 
the problem to optimality. 

To implement the most powerful adaptation 'Exact', a new approach was proposed. 
Note that the same approach can be applied to many other TSP local searches. Comparing 
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to the previous results in the literature, the time complexity of exploration of the corre- 
sponding neighborhood is significantly reduced which makes this adaptation practical. 
Though it was often outperformed by either faster adaptations or the memetic algorithm 
in our experiments, it is clearly the best heuristic for small instances (up to 20 clusters in 
our experiments) and it is also naturally suitable for the asymmetric GTSP. 

Further research on adaptation of the Lin-Kernighan heuristic for other combinatorial 
optimization problems may be of interest. Our future plans also include a thorough study 
of different GTSP neighborhoods and their combinations. 
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Table 3: Details of experiment results for the small (10 to 29 clusters) instances. 



Solution en'or, % Running time, ms 



Instance 


Best 


20™ 




S2co 


Ei 


ma 


20™ 


q2co 


S2co 




ma 


10att48 


5394 


6.3 


0.0 


0.0 


0.0 


0.0 


0.24 


0.25 


0.28 


2.53 


18.72 


10gr48 


1834 


4.9 


0.0 


0.0 


0.0 


0.0 


0.01 


0.03 


0.06 


1.45 


12.48 


10hk48 


6386 


0.0 


0.3 


0.0 


0.0 


0.0 


0.01 


0.08 


0.17 


1.36 


18.72 


lleil51 


174 


4.0 


0.6 


0.0 


0.0 


0.0 


0.01 


0.04 


0.13 


1.23 


17.16 


llberlin52 


4040 


0.0 


0.0 


0.4 


0.0 


0.0 


0.01 


0.06 


0.12 


1.17 


12.48 


12brazil58 


15332 


2.1 


0.0 


0.0 


0.0 


0.0 


0.01 


0.06 


0.11 


1.67 


12.48 


14st70 


316 


6.3 


0.0 


0.3 


0.0 


0.0 


0.02 


0.07 


0.16 


3.18 


21.84 


16eil76 


209 


4.8 


0.0 


0.0 


0.0 


0.0 


0.01 


0.06 


0.23 


4.23 


21.84 


16pr76 


64925 


1.6 


1.4 


0.0 


0.0 


0.0 


0.02 


0.11 


0.27 


4.10 


26.52 


20gr96 


29440 


2.4 


1.0 


0.0 


0.0 


0.0 


0.03 


0.22 


0.42 


9.09 


28.08 


20rat99 


497 


7.8 


0.2 


0.0 


0.0 


0.0 


0.03 


0.24 


0.64 


15.05 


37.44 


20kroal00 


9711 


4.2 


5.8 


0.0 


0.0 


0.0 


0.03 


0.17 


0.46 


14.59 


31.20 


20krobl00 


10328 


0.0 


0.0 


0.0 


0.0 


0.0 


0.01 


0.10 


0.36 


15.64 


28.08 


20krocl00 


9554 


10.1 


0.1 


0.0 


0.0 


0.0 


0.03 


0.17 


0.54 


8.05 


31.20 


20krodl00 


9450 


1.5 


0.0 


2.0 


0.0 


0.0 


0.05 


0.17 


0.44 


10.12 


39.00 


20kroel00 


9523 


1.3 


4.4 


0.0 


0.0 


0.0 


0.03 


0.15 


0.37 


8.33 


31.20 


20rdl00 


3650 


7.1 


0.1 


0.9 


0.0 


0.0 


0.03 


0.15 


0.53 


18.02 


34.32 


21eill01 


249 


4.4 


0.4 


0.8 


0.4 


0.0 


0.02 


0.16 


0.30 


7.24 


43.68 


211inl05 


8213 


0.1 


0.0 


0.0 


0.0 


0.0 


0.02 


0.14 


0.36 


5.50 


32.76 


22prl07 


27898 


4.4 


0.0 


0.0 


0.0 


0.0 


0.01 


0.23 


0.42 


24.48 


31.20 


24grl20 


2769 


20.5 


2.8 


2.6 


0.0 


0.0 


0.03 


0.24 


0.77 


10.77 


43.68 


25prl24 


36605 


4.5 


0.0 


0.5 


0.0 


0.0 


0.05 


0.39 


0.81 


14.76 


46.80 


26bierl27 


72418 


6.9 


8.6 


0.0 


0.0 


0.0 


0.08 


0.36 


0.69 


12.45 


54.60 


26chl30 


2828 


12.1 


0.0 


0.0 


0.0 


0.0 


0.09 


0.24 


0.71 


18.14 


48.36 


28prl36 


42570 


9.7 


0.8 


0.0 


0.0 


0.0 


0.04 


0.49 


0.77 


14.24 


49.92 


28grl37 


36417 


1.9 


1.4 


1.3 


0.1 


0.0 


0.04 


0.27 


0.97 


62.66 


51.48 


29prl44 


45886 


4.0 


0.0 


0.0 


0.0 


0.0 


0.03 


0.36 


0.58 


15.31 


40.56 


Average 




4.9 


1.0 


0.3 


0.0 


0.0 


0.04 


0.19 


0.43 


11.31 


32.07 
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Table 4: Detailed experiment results for the moderate and large instances (m > 30). The 
reported values are relative solution errors, %. 



Instance 


Best 


20^^ 






S2co 


S3co 


Sf° 


g4co 


El 


ma 


jucnljU 


2750 


6.5 


7.1 


1.7 


1.1 


0.0 


0.0 


0.3 


1.1 


0.0 


■-)f\l 1 Cf\ 

JUKroal50 


1 1018 


16.2 


8.2 


0.1 


1.6 


0.0 


0.0 


0.0 


0.0 


0.0 


■-)r\i: u 1 cr\ 

jUKrobljO 


12196 


5.4 


5.4 


0.0 


1.0 


0.0 


0.5 


0.0 


0.0 


0.0 


31prl52 


51576 


4.1 


3.3 


3.9 


1.9 


0.0 


0.0 


0.0 


1.2 


0.0 


32ul59 


22664 


24.9 


10.2 


0.8 


0.4 


0.0 


0.0 


0.0 


1.1 


0.0 


35sil75 


5564 


2.6 


3.8 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


jobrgloU 


4420 


314.5 


314.5 


0.5 


78.3 


0.0 


0.0 


0.0 


0.0 


0.0 


jyratiyj 


854 


7.6 


12.5 


1.4 


2.0 


0.2 


1.3 


0.1 


0.0 


0.0 


A A J 1 no 

40(1196 


10557 


1.3 


3.5 


1.3 


0.3 


0.0 


0.5 


0.0 


0.2 


0.0 


4UKroa2UU 


13406 


8.3 


4.8 


0.6 


0.4 


0.4 


0.4 


0.4 


0.0 


0.0 


40Krob200 


13111 


14.6 


14.0 


0.1 


2.7 


0.2 


0.1 


0.0 


0.0 


0.0 


4 1 grzUz 


23301 


10.5 


7. 1 


3.1 


4.3 


2.5 


1.9 


A A 


0.0 


0.0 


45ts225 


68340 


7.2 


6.8 


0.1 


0.3 


0.3 


0.1 


0.3 


0.1 


0.0 


45tsp225 


1612 


12.3 


6.6 


0.6 


1.0 


1.9 


0.3 


0.3 


0.0 


0.0 


4opr22o 


64007 


14.2 


l.I 


1.1 


1.1 


0.0 


0.0 


0.0 


0.0 


0.0 


4ogr22y 


71972 


7.6 


8.1 


1.2 


1.0 


0.0 


0.0 


0.0 


0.9 


0.0 


53gil262 


1013 


20.6 


11.1 


3.6 


0.7 


0.8 


0.7 


0.0 


0.2 


0.0 


53pr264 


29549 


9.9 


0.7 


0.8 


0.8 


1.0 


0.4 


0.2 


0.5 


0.0 


56a280 


1079 


5.9 


3.3 


2.3 


0.8 


0.3 


0.3 


0.0 


0.6 


0.0 


oUpriyy 


22615 


8.0 


4.0 


3.9 


1 .0 


U.Z 


A n 
U.U 


0.0 


0. 1 


A A 


o41in-5 1 o 


20765 


10. 1 


8.3 


3.7 


2.5 


0.9 


0.0 


0.0 


2.6 


0.0 


S0ra4u0 


6361 


1 1.4 


7.9 


2.3 


1.3 


2.8 


1.1 


2.0 


0.7 


0.0 


O /I fl /I IT 


9651 


0.5 


1.5 


1.8 


1.6 


0.5 


0.0 


0.1 


0.0 


0.0 


87gr431 


101946 


5.1 


5.2 


2.6 


3.2 


3.6 


2.5 


1.1 


0.0 


0.0 


88pr439 


60099 


9.7 


5.9 


1.8 


1.2 


1.4 


1.3 


0.0 


1.1 


0.0 


89pcb442 


21657 


7.8 


5.5 


2.9 


0.1 


1.0 


0.0 


1.7 


2.1 


0.0 


99d493 


20023 


8.3 


5.8 


2.1 


3.3 


2.4 


0.7 


1.4 


2.3 


0.0 


107ali535 


128639 


15.9 


5.0 


3.1 


2.6 


0.5 


0.0 


0.4 


0.5 


0.0 


107att532 


13464 


11.3 


5.6 


0.8 


1.5 


0.5 


0.8 


0.1 


0.1 


0.0 


107si535 


13502 


2.4 


1.3 


0.3 


0.1 


0.1 


0.0 


0.0 


0.3 


0.0 


113pa561 


1038 


10.7 


6.3 


1.4 


2.9 


1.7 


1.6 


1.6 


0.6 


0.0 


115u574 


16689 


10.4 


9.5 


5.7 


5.1 


0.2 


1.1 


1.0 


1.6 


0.0 


115rat575 


2388 


13.4 


11.5 


4.4 


4.2 


3.5 


3.2 


3.2 


1.3 


0.2 


131p654 


27428 


2.0 


1.4 


0.3 


2.5 


0.2 


0.0 


0.0 


0.2 


0.0 


132d657 


22498 


10.6 


9.5 


4.6 


3.9 


1.7 


1.6 


0.5 


1.9 


0.1 


134gr666 


163028 


10.7 


5.7 


2.2 


2.4 


1.9 


2.5 


2.0 


1.0 


0.2 


145u724 


17272 


12.5 


13.1 


4.6 


2.3 


1.3 


2.9 


0.3 


1.3 


0.0 


157rat783 


3262 


19.7 


12.9 


4.7 


2.9 


3.5 


0.3 


1.6 


1.3 


0.1 


200dsjl()00 


9187884 


14.8 


8.9 


4.3 


4.4 


0.8 


1.5 


1.9 


2.6 


0.1 


201prl002 


114311 


16.3 


8.4 


3.6 


0.2 


0.2 


1.5 


0.8 


0.1 


0.2 


207silO32 


22306 


5.2 


4.1 


1.7 


1.2 


0.9 


0.1 


0.1 


0.9 


0.0 


212ul060 


106007 


13.7 


9.0 


3.6 


2.3 


1.8 


1.7 


2.1 


0.7 


0.2 


217vml084 


130704 


12.4 


8.2 


3.1 


3.0 


2.2 


2.0 


2.1 


1.8 


0.3 


Average 




17.1 


13.9 


2.2 


3.6 


1.0 


0.8 


0.6 


0.7 


0.0 


Light avg. 




11.4 


7.6 


2.5 


2.0 


1.1 


0.9 


0.7 


0.8 


0.0 


Heavy avg. 




42.3 


41.3 


0.8 


10.6 


0.3 


0.1 


0.1 


0.3 


0.0 
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Table 5: Detailed experiment results for the moderate and large instances (m > 30). The 
reported values are running times, ms. 



Instance 


20™ 




C2co 


S2co 




S4co 


S4co 


El 


ma 


JUcnl5u 


0.1 


0.1 


0.5 


1.4 


2.8 


2.7 


8.0 


46.7 


56.2 


iUkroaljU 


0.0 


0.1 


0.4 


1.0 


1.8 


4.0 


4.2 


32.3 


57.7 


iUkrobljO 


0.0 


0.1 


0.4 


1.2 


1.5 


2.5 


7.0 


50.6 


65.5 


31prl52 


0.0 


0.2 


0.4 


1.4 


4.5 


25.3 


33.4 


38.8 


39.0 


32ul59 


0.1 


0.1 


0.3 


0.9 


2.7 


4.2 


25.7 


31.9 


62.4 


35sil75 


0.1 


0.2 


1.8 


3.6 


10.0 


23.5 


358.8 


232.5 


64.0 


JobrgloU 


0.0 


0.2 


0.4 


0.4 


1.2 


2.3 


279.3 


46.4 


53.0 


39ratl95 


0.1 


0.1 


0.7 


1.2 


3.1 


7.3 


13.7 


64.9 


138.8 


Ar\A 1 no 

4Uaiyo 


0.2 


0.6 


2.0 


3.7 


21.9 


134.2 


310.4 


98.7 


126.4 


4UKroa2uU 


0.1 


0.1 


0.7 


1.6 


3.2 


A 1 

4.1 


1 1.7 


*;a a 
60. D 


123.2 


A r\i. „i_ir\r\ 

4UKrob2uU 


0.1 


0.2 


0.5 


1.4 


2.4 


4.2 


16.2 


56.3 


157.6 


A 1 r.^OAO 

4 1 grzUz 


A 1 

0. 1 


0.3 


0.8 


1.9 


1 
/.5 


1 1 A 


81.2 


86.1 


198.1 


45ts225 


0.1 


0.3 


0.7 


3.0 


8.0 


10.0 


19.9 


273.0 


191.9 


45tsp225 


0.1 


0.2 


0.8 


2.3 


3.1 


7.6 


15.5 


1 12.3 


156.0 


4opr22o 


0.1 


0.4 


1.0 


1.9 


4.5 


12.7 


21.7 


44.1 


95.2 


4ogr22y 


0.1 


0.2 


1.0 


3.2 


3.5 


8.8 


13.9 


145.1 


224.6 


53gil262 


0.2 


0.3 


1.9 


3.8 


7.9 


9.2 


21.3 


107.8 


290.2 


5jpr2o4 


0.2 


1.0 


5.7 


6.5 


66.2 


282.4 


505.4 


230.9 


204.4 


56a280 


0.2 


0.3 


1.1 


2.2 


1 1.2 


9.3 


43.9 


148.2 


291.7 


bupriyy 


0.1 


0.2 


1.5 


3.8 


8.7 


12.6 


31.4 


146.7 


347.9 


D41in J 1 


0.2 


0.3 


2.0 


4.2 


17.3 


48. 


81.4 


223.1 


404.0 


oUra40U 


0.3 


0.7 


3.8 


5.6 


18.2 


36.7 


74.4 


305.8 


872.0 


84n4 1 / 


0.3 


2.3 


5.9 


9.7 


59.0 


174.8 


315.1 


645.8 


583.4 


87gr431 


0.4 


0.8 


4.4 


9.3 


19.8 


59.6 


107.9 


485.2 


1673.9 


88pr439 


0.3 


0.8 


3.0 


11.6 


24.4 


54.3 


109.3 


764.4 


1146.6 


89pcb442 


0.5 


0.8 


4.1 


9.5 


23.1 


42.9 


88.8 


656.8 


1530.4 


99d493 


0.7 


2.0 


7.5 


13.1 


148.3 


2666.1 


1616.2 


591.2 


3675.4 


107ali535 


1.0 


2.3 


7.1 


13.4 


29.9 


52.2 


170.2 


795.6 


3558.4 


107att532 


0.6 


1.9 


8.0 


17.1 


33.1 


71.5 


312.1 


932.9 


2942.2 


107si535 


0.5 


5.5 


32.9 


46.7 


337.0 


1921.9 


12725.0 


3503.8 


1449.2 


113pa561 


0.7 


1.3 


5.4 


11.6 


28.6 


51.0 


104.2 


695.8 


2931.3 


115u574 


0.7 


1.9 


6.8 


10.7 


53.3 


63.9 


156.1 


956.3 


3017.1 


115rat575 


0.5 


1.3 


6.4 


17.9 


41.0 


92.9 


128.0 


697.3 


2867.3 


131p654 


1.2 


9.4 


40.9 


27.3 


213.9 


1074.8 


2964.0 


3293.2 


2137.2 


132d657 


0.9 


2.3 


13.6 


22.0 


109.4 


1009.3 


2322.9 


794.0 


4711.2 


134gr666 


1.0 


2.3 


8.7 


28.1 


51.5 


135.9 


374.4 


1425.8 


10698.6 


145u724 


1.0 


2.7 


13.4 


32.6 


62.8 


105.8 


242.0 


1326.0 


7952.9 


157rat783 


1.5 


2.2 


17.7 


30.8 


73.7 


131.3 


248.3 


2165.3 


9459.9 


200dsjl000 


3.5 


10.3 


80.7 


104.5 


592.8 


5199.5 


8032.5 


9361.6 


22704.4 


201prl002 


2.3 


6.2 


39.1 


57.0 


156.4 


290.6 


539.8 


2719.1 


21443.9 


207sil032 


3.5 


37.4 


839.3 


875.2 


7063.7 


195644.0 


306944.8 


112926.4 


17840.3 


212ul060 


3.7 


7.1 


36.4 


80.2 


195.5 


307.5 


1040.5 


2990.5 


31201.8 


217vml084 


2.5 


6.6 


51.4 


78.5 


204.8 


496.1 


978.1 


4687.8 


27587.2 


Average 


0.7 


2.6 


29.3 


36.3 


226.4 


4890.9 


7941.8 


3604.6 


4310.1 


Light avg. 


0.7 


1.6 


9.5 


16.8 


56.0 


315.7 


488.5 


972.0 


4653.6 


Heavy avg. 


0.8 


7.1 


116.1 


121.6 


971.6 


24907.3 


40550.4 


15122.2 


2807.2 



25 



